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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: THE SCRIPPS RESEARCH INSTITUTE 

(B) STREET: 10550 North Torrey Pines Road 

(C) CITY: La Jolla 

(D) STATE: California 

(E) COUNTRY: US 

(F) ZIP: 92037 

(G) TELEPHONE: (619) 784-2937 

(H) TELEFAX: (619) 784-9399 

(ii) TITLE OF INVENTION: MHC CLASS II ANTIGEN PRESENTING SYSTEMS 



(iii) NUMBER OF SEQUENCES: 56 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US97/ 

(B) FILING DATE: 22 -MAY- 1997 

(C) CLASSIFICATION: 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/018,175 

(B) FILING DATE: 23 -MAY- 1996 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



ATTCGATGCA 


CACTCACATT 


CTTCTCCTAA 


TACGATAATA 


AAACTTTCCA 


TGAAAAATAT 


60 


GGAAAAATAT 


ATGAAAATTG 


AGAAATCCAA 


AAAACTGATA 


AACGCTCTAC 


TTAATTAAAA 


120 


TAGATAAATG 


GGAGCGGCTG 


GAATGGCGGA 


GCATGACCAA 


GTTCCTCCGC 


CAATCAGTCG 


180 


TAAAACAGAA 


GTCGTGGAAA 


GCGGATAGAA 


AGAATGTTCG 


ATTTGACGGG 


CAAGCATGTC 


240 


TGCTATGTGG 


CGGATTGCGG 


AGGAATTGCA 


CTGGAGACCA 


GCAAGGTTCT 


CATGACCAAG 


300 


AATATAGCGG 


TGTGAGTGAG 


CGGGAAGCTC 


GGTTTCTGTC 


CAGATCGAAC 


TCAAAACTAG 


360 


TCCAGCCAGT 


CGCTGTCGAA 


ACTAATTAAG 


TTAATGAGTT 


TTTCATGTTA 


GTTTCGCGCT 


420 


GAGCAACAAT 


TAAGTTTATG 


TTTCAGTTCG 


GCTTAGATTT 


CGCTGAAGGA 


CTTGCCACTT 


480 


TCAATCAATA 


CTTTAGAACA 


AAATCAAAAC 


TCATTCTAAT 


AGCTTGGTGT 


TCATCTTTTT 


540 


TTTTAATGAT 


AAGCATTTTG 


TCGTTTATAC 


TTTTTATATT 


TCGATATTAA 


ACCACCTATG 


600 


AAGTTCATTT 


TAATCGCCAG 


ATAAGCAATA 


TATTGTGTAA 


ATATTTGTAT 


TCTTTATCAG 


660 


GAAATTCAGG 


GAGACGGGGA 


AGTTACTATC 


TACTAAAAGC 


CAAACAATTT 


CTTACAGTTT 


720 


TACTCTCTCT 


ACTCTAGAGT 










740 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



AATTCGTTGC AGGACAGGAT GTGGTGCCCG ATGTGACTAG CTCTTTGCTG CAGGCCGTCC 
TATCCTCTGG TTCCGATAAG AGACCCAGAA CTCCGGCCCC CCACCGCCCA CCGCCACCCC 



60 
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CATACATATG TGGTACGCAA GTAAGAGTGC CTGCGCATGC CCCATGTGCC CCACCAAGAG 180 

TTTTGCATCC CATACAAGTC CCCAAAGTGG AGAACCGAAC CAATTCTTCG CGGGCAGAAC 240 

AAAAGCTTCT GCACACGTCT CCACTCGAAT TTGGAGCCGG CCGGCGTGTG CAAAAGAGGT 300 

GAATCGAACG AAAGACCCGT GTGTAAAGCC GCGTTTCCAA AATGTATAAA ACCGAGAGCA 360 

TCTGGCCAAT GTGCATCAGT TGTGGTCAGC AGCAAAATCA AGTGAATCAT CTCAGTGCAA 420 

CTAAAGG 427 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTTGAATTCC ACCATGCCGT GCAGCAGAGC TCTGA 35 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
TTTGGATCCT CATAAAGGCC CTGGGTGTC 29 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTTGAATTCC ACCATGGCTC TGCAGATCCC CA 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TTTGGATCCT CACTGCAGGA GCCCTGCT 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: circular 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GCGTTGCAGG 


ACAGGATGTG 


GTGCCCGATG 


TGACTAGCTC 


TTTGCTGCAG 


GCCGTCCTAT 


60 


CCTCTGGTTC 


^ A A A £^ A A 

CGATAAGAGA 


CCCAGAACTC 


CGGCCCCCCA 


CCGCCCACCG 


CCACCCCCAT 


120 


ACATATGTGG 


TACGCAAGTA 


AGAGTGCCTG 


CGCATGCCCC 


ATGTGCCCCA 


CCAAGAGTTT 


180 


TGCATCCCAT 


ACAAGTCCCC 


AAAGTGGAGA ACCGAACCAA TTCTTCGCGG 


GCAGAACAAA 


240 


AGCTTCTGCA 


CACGTCTCCA 


CTCGAATTTG 


GAGCCGGCCG 


GCGTGTGCAA AAGAGGTGAA 


300 


^ ^> A A ^* AAA 

TCGAACGAAA 


GACCCGTGTG 


TAAAGCCGCG 


TTTCCAAAAT 


GTATAAAACC 


GAGAGCATCT 


360 


GGCCAATGTG 


CATCAGTTGT 


GGTCAGCAGC 


AAAATCAAGT 


GAATCATCTC 


AGTGCAACTA 


420 


AAGGGGGGAA 


TTCCTGCAGA 


GACCTCCCAG 


AGACCAGGAT 


GCCGTGCAGC 


AGAGCTCTGA 


480 


TTCTGGGGGT 


CCTCGCCCTG 


AACACCATGC 


TCAGCCTCTG 


CGGAGGTGAA 


GACGACATTG 


540 


AGGCCGACCA 


CGTAGGCTTC 


TATGGTACAA 


CTGTTTATCA 


GTCTCCTGGA 


GACATTGGCC 


600 


A /""T* A A ^ A A 

AGTACACACA 


TGAATTTGAT 


GGTGATGAGT 


TGTTCTATGT 


GGACTTGGAT AAGAAGAAAA 


660 


CTGTCTGGAG 


GCTTCCTGAG 


TTTGGCCAAT 


TGATACTCTT 


TGAGCCCCAA 


GGTGGACTGC 


720 


A A A A /*• A m A ^ ^ 

AAAACATAGC 


TGCAGAAAAA 


CACAACTTGG 


GAATCTTGAC 


TAAGAGGTCA AATTTCACCC 


780 


CAGCTACCAA 


TGAGGCTCCT 


CAAGCGACTG 


TGTTCCCCAA 


GTCCCCTGTG 


CTGCTGGGTC 


840 


AGCCCAACAC 


CCTTATCTGC 


TTTGTGGACA ACATCTTCCC 


ACCTGTGATC 


AACATCACAT 


900 


GGCTCAGGAA 


TAGCAAGTCA 


GTCACAGACG 


GCGTTTATGA 


GACCAGCTTC 


CTCGTCAACC 


960 


GTGACCATTC 


CTTCCACAAG 


CTGTCTTATC 


TCACCTTCAT 


CCCTTCTGAT 


GATGACATTT 


1020 


ATGACTGCAA 


GGTGGAGCAC 


TGGGGCCTGG 


AGGAGCCGGT 


TCTGAAACAC 


TGGGAACCTG 


1080 


AGATTCCAGC 


CCCCATGTCA 


GAGCTGACAG 


AAACTGTGGT 


GTGTGCCCTG 


GGGTTGTCTG 


1140 


TGGGCCTTGT 


GGGCATCGTG 


GTGGGCACCA 


TCTTCATCAT 


TCAAGGCCTG 


CGATCAGGTG 


1200 


GCACCTCCAG 


ACACCCAGGG 


CCTTTATGAG 


TCACACCCTG 


GAAAGGAAGG 


TGTGTGTCCC 


1260 


TCTTCATGGA 


AGAAGTGGTG 


TTCTGGGTGT 


CGAATTCGAG 


CTCGGTACCC 


GGGGATCCTC 


1320 


TAGAGTCGAC 


CTGCAGGCAT 


GCAATTCGAT 


GCACACTCAC 


ATTCTTCTCC 


TAATACGATA 


1380 


ATAAAACTTT 


CCATGAAAAA 


TATGGAAAAA 


TATATGAAAA 


TTGAGAAATC 


CAAAAAACTG 


1440 


ATAAACGCTC 


TACTTAATTA 


AAATAGATAA 


ATGGGAGCGG 


CAGGAATGGC 


GGAGCATGGC 


1500 


CAAGTTCCTC 


CGCCAATCAG 


TCGTAAAACA GAAGTCGTGG AAAGCGGATA GAAAGAATGT 


1560 


TCGATTTGAC 


GGGCAAGCAT 


GTCTGCTATG 


TGGCGGATTG 


CGGAGGAATT 


GCACTGGAGA 


1620 


CCAGCAAGGT 


TCTCATGACC 


AAGAATATAG 


CGGTGAGTGA 


GCGGGAAGCT 


CGGTTTCTGT 


1680 


CCAGATCGAA 


CTCAAAACTA 


GTCCAGCCAG 


TCGCTGTCGA 


AACTAATTAA 


GTAAATGAGT 


1740 
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TTTTCATGTT 


AGTTTCGCGC 


TGAGCAACAA 


TTAAGTTTAT 


GTTTCAGTTC 


GGCTTAGATT 


1800 


TCGCTGAAGG 


ACTTGCCACT 


TTCAATCAAT 


ACTTTAGAAC 


AAAATCAAAA 


CTCATTCTAA 


1860 


TAGCTTGGTG 


TTCATCTTTT 


TTTTTAATGA 


TAAGCATTTT 


GTCGTTTATA 


CTTTTTATAT 


1920 


TTCGATATTA 


AACCACCTAT 


GAAGTTCATT 


TTAATCGCCA 


GATAAGGAAT 


ATATTGTGTA 


1980 


A ATA T'l'T^T A 
AAiAi i itjlA 


TTCTTTATCA 


GGAAATTCAG 


GGAGACGGGG 


AAGTTACTAT 


CTACTAAAAG 


2040 




TCTTACAGTT 


TTACTCTCTC 


rp A /im^rn A /-I A /-I 

TACTCTAGAG 


CTTGGCACTG 


GCCGTCGTTT 


2100 




TGACTGGGAA AACCCTGGCG 


TT A A A 

TTACCCAACT 


TAATCGCCTT 


GCAGCAGATC 


2160 


CCCCliluGC 


CAGCTGGCGT 


AATAGCGAAG 


AGGCGCGCAC 


CGATCGGCCT 


TCCCAACAGT 


2220 


TGCGCAGCCT 


GkkTGQQGkk 


TGGCGCCTGA 


TGCGGTATTT 


TCTCCTTACG 


CATCTGTGCG 


2280 


GTATTTCACA 


CCGCATATGG 


TGCACTCTCA 

JL v7 v>AA\^ J. U X WXa 


GTAGAATCTG 


CTCTGATGCC 


GCATAGTTAA 


2340 


GCCAGCCCCG 


ACACCCGCCA 


ACACCCGCTG 


ACGCGCCCTG 


ACGGGGTTGT 


CTGCTCCCGG 


2400 


CATCCGCTTA 


CAGACAAGCT 


GTGACCGTCT 


CGGGGAGCTG 


CATGTGTCAG 


AGGTTTTCAC 


2460 


CGTCATCACC 


GAAACGCGCG 


AGACGAAAGG 


GCCTCGTGAT 


ACGGCTATTT 


TTATAGGTTA 


2520 


ATGTCATGAT 


AATAATGGTT 


TCTTAGACGT 


CAGGTGGGAC 


TTTTGGGGGA 


AATGTGCGCG 


2580 


GAACCCCTAT 


TTGTTTATTT 


TTCTAAATAC 


ATTCAAATAT 


GTATCCGCTC 


ATGAGACAAT 


2640 


AACCCTGATA AATGCTTCAA 


TAATATTGAA 


AAAGGAAGAG 


TATGAGTATT 


CAACATTTCC 


2700 








TTTGCCTTCC 


TGTTTTTGCT 


CACCCAGAAA 


2760 


nnnTnnTfiAA 

V> VJ O 1. \J\J X VJiXTl. 


Af^TAAAAriAT 


GCTGAACATC: 


AGTTGGGTGC 


ACGAGTGGGT 


TACATCGAAC 


2820 




TAnrrinTAAfi 


ATCCTTGAGA 


GTTTTCGCCC 


CGAAGAACGT 


TTTCCAATGA 


2880 


TGAGCACTTT 


TAAAGTTCTG 


CTATGTGGCG 


CGGTATTATC 


CGGTATTGAC 


GCCGGGCAAG 


2940 


AGCAACTCGG 


TCGCCGCATA 


CACTATTCTC 

X X X o X 


AGAATGACTT 


GGTTGAGTAC 


TCACGAGTGA 


3000 


CAGAAAAGGA 


TCTTACGGAT 


GGCATGACAG 


TAAGAGAATT 


ATGCAGTGCT 


GCGATAACCA 


3060 


TGAGTGATAA 


CACTGCGGCC 


AACTTACTTC 


Tt^^^ A ^ A A ^ ^% A 

TGACAACGAT 


CGGAGGACCG 


AAGGAGCTAA 


3120 


CCGCTTTTTT 


GCACAACATG 


O VJVJVJii X wii X V7 


TAACTCGCCT 


TGATCGTTGG 


GAACCGGAGC 


3180 


TGAATGAAGC 


CATACCAAAC 




ACACCACGAT 


GCCTGTAGCA ATGGCAACAA 


3240 


CGTTGCGCAA 


ACTATTAACT 


f^nnnAArTAr 


TTACTCTAGC 


TTCCCGGCAA 


CAATTAATAG 


3300 


ACTGGATGGA 


GGCGGATAAA 




CACTTCTGCG 


CTCGGCCCTT 


CCGGCTGGCT 


3360 


GGTTTATTGC 


TGATAAATCT 




AGCGTGGGTG 


TCGGGGTATC 


ATTGGAGCAC 


3420 


TGGGGCCAGA 


TGGTAAGCCC 


TCCGGTATCG 


TAGTTATCTA 


CACGACGGGG 


AGTCAGGCAA 


3480 


CTATGGATGA 


kCGkkkTkQh 


CAGATCGCTG 


AGATAGGTGC 


CTCACTGATT 


AAGGATTGGT 


3540 


AACTGTCAGA 


CCAAGTTTAC 


TCATATATAC 


TTTAGATTGA 


TTTAAAACTT 


CATTTTTAAT 


3600 


TTAAAAGGAT 


CTAGGTGAAG 


ATCCTTTTTG 


ATAATCTCAT 


GACCAAAATC 


CCTTAACGTG 


3660 


AGTTTTCGTT 


CCACTGAGCG 


TCAGACCCCG 


TAGAAAAGAT 


CAAAGGATCT 


TCTTGAGATC 


3720 


CTTTTTTTCT 


GCGCGTAATC 


TGCTGCTTGC 


AAACAAAAAA 


ACCACGGCTA 


CCAGCGGTGG 


3780 


TTTGTTTGCC 


GGATCAAGAG 


CTACCAACTC 


TTTTTCCGAA 


GGTAACTGGC 


TTCAGCAGAG 


3840 


CGCAGATACC 


AAATACTGTC 


CTTCTAGTGT 


AGCCGTAGTT 


AGGCCACCAC 


TTCAAGAACT 


3900 


CTGTAGCACC 


GCCTACATAC 


CTCGCTCTGC 


TAATCCTGTT 


ACCAGTGGCT 


GCTGCCAGTG 


3960 


GCGATAAGTC 


GTGTCTTACC 


GGGTTGGACT 


GAAGACGATA 


GTTACCGGAT 


AAGGCGCAGC 


4020 


GGTCGGGCTG 


AACGGGGGGT 


TCGTGCACAC AGCCGAGCTT 


GGAGCGAACG 


AGCTACACCG 


4080 
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AACTGAGATA CCTACAGCGT GAGCATTGAG AAAGCGCCAC GCTTCCCGAA GGGAGAAAGG 4140 

CGGACAGGTA TCCGGTAAGC GGCAGGGTCG GAACAGGAGA GCGCACGAGG GAGCTTCCAG 4200 

GGGGAAACGC CTGGTATCTT TATAGTCCTG TCGGGTTTCG CCACCTCTGA CTTGAGCGTC 4260 

GATTTTTGTG ATGCTCGTCA GGGGGGCGGk GCCTATGGAA AAACGCCAGC AACGCGGCCT 4320 

TTTTACGGTT CCTGGCCTTT TGCTGGCCTT TTGCTCACAT GTTCTTTCCT GCGTTATCCC 4380 

CTGATTCTGT GGATAACCGT ATTACCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC 4440- 

GAAGGACCGA GCGCAGCGAG TCAGTGAGCG AGGAAGCGGA AGAGCGCCCA ATACGCAAAC 4500 

CGCCTCTCCC CGCGCGTTGG CCGATTCATT AATGCAGCTG GCACGACAGG TTTCCCGACT 4560 

GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAGTTA GCTCACTCAT TAGGCACCCC 4620 

AGGCTTTACA CTTTATGCTT CCGGCTCGTA TGTTGTGTGG AATTGTGAGC GGATAACAAT 4680 

TTCACACAGG AAACAGCTAT GACCATGATT ACG 4713 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCGTTGCAGG ACAGGATGTG GTGCCCGATG TGACTAGCTC TTTGCTGCAG GCCGTCCTAT 60 

CCTCTGGTTC CGATAAGAGA CCCAGAACTC CGGCCCCCCA CCGCCCACCG CCACCCCCAT 120 

ACATATGTGG TACGCAAGTA AGAGTGCCTG CGCATGCCCC ATGTGCCCCA CCAAGAGTTT 180 

TGCATCCCAT ACAAGTCCCC AAAGTGGAGA ACCGAACCAA TTCTTCGCGG GCAGAACAAA 240 

AGCTTCTGCA CACGTCTCCA CTCGAATTTG GAGCCGGCCG GCGTGTGCAA AAGAGGTGAA 300 

TCGAACGAAA GACCCGTGTG TAAAGCCGCG TTTCCAAAAT GTATAAAACC GAGAGCATCT 360 

GGCCAATGTG CATCAGTTGT GGTCAGCAGC AAAATCAAGT GAATCATCTC AGTGCAACTA 420 

AAGGGGGGAA TTCCCTGCTG TGCCCTAGAG ATGGCTCTGC AGATCCCCAG CCTCCTCCTC 480 

TCAGCTGCTG TGGTGGTGCT GATGGTGCTG AGCAGCCCAG GGACTGAGGG CGGAAACTCC 540 

GAAAGGCATT TCGTGGTCCA GTTCAAGGGC GAGTGCTACT ACACCAACGG GACGCAGCGC 600 

ATACGGCTCG TGACCAGATA CATCTACAAC CGGGAGGAGT ACGTGCGCTA CGACAGCGAC 660 

GTGGGCGAGT ACCGCGCGGT GACCGAGCTG GGGCGGCCAG ACGCCGAGTA CTGGAACAGC 720 
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CAGCCGGAGA 


TCCTGGAGCG 


AACGCGGGGC 


GAGGTGGACA 


CGGCGTGCAG 


ACACAACTAC 


780 


GAGGGGCCGG 


AGACCAGCAC 


CTCCCTGCGG 


CGGCTTGAAC 


AGCCCAATAT 


CGCCATCTCC 


840 


CTGTCCAGGA 


CAGAGGCCCT 


CAACCACCAC 


AACACTCTGG 


TCTGTTCGGT 


GAGAGATTTC 


900 


TACCCAGCCA 


AGATCAAAGT 


GCGCTGGTTC 


AGGAATGGCC 


AGGAGGAGAC 


AGTGGGGGTC 


960 


T CATC GAG AC 


AGCTTATTAG 


GAATGGGGAC 


TGGACCTTCC 


AGGTCCTGGT 


CATGCTGGAG 


1020 


ATGACCCCTC 


ATCAGGGAGA 


GGTCTACACC 


TGCCATGTGG 


AGCATCCCAG 


CCTGAAGAGC 


1080 


CCCATCACTG 


TGGAGTGGAG 


GGCACAGTCC 


GAGTCTGCCC 


GGAGCAAGAT 


GTTGAGCGGC 


1140 


ATCGGGGGCT 


GCGTGCTTGG 


GGTGATCTTC 


CTCGGGCTCG 


GCCTTTTCAT 


CCGTCACAGG 


1200 


AGTCAGAAAG 


GACCTCGAGG 


CCCTCCTCCA 


GCAGGGCTCC 


TGCAGTGACT 


CAGAGTGTTT 


1260 


TGACTCAGTT 


GACTGTCTCA 


GACTGTAAGA 


CCTACATGTC 


TCGAATTCGA 


GCTCGGTAGC 


1320 


CGGGGATCCT 


CTAGAGTCGA 


CCTGCAGGCA 


TGCAATTCGA 


TGCACACTCA 


CATTCTTCTC 


1380 


CTAATACGAT 


AATAAAACTT 


TCCATGAAAA 


ATATGGAAAA 


ATATATGAAA 


ATTGAGAAAT 


1440 


CCAAAAAACT 


GATAAACGCT 


CTACTTAATT 


AAAATAGATA 


AATGGGAGCG 


GCAGGAATGG 


1500 


CGGAGCATGG 


CCAAGTTCCT 


CCGGCAATCA 


GTCGTAAAAC 


AGAAGTCGTG 


GAAAGCGGAT 


1560 


AGAAAGAATG 


TTCGATTTGA 


CGGGGAAGCA 


TGTCTGCTAT 


GTGGCGGATT 


GCGGAGGAAT 


1620 


TGCACTGGAG 


ACCAGCAAGG 


TTCTCATGAC 


CAAGAATATA 


GCGGTGAGTG 


AGCGGGAAGC 


1680 


TCGGTTTCTG 


TCCAGATCGA 


ACTCAAAACT 


AGTCCAGCCA 


GTCGCTGTCG 


AAACTAATTA 


1740 


AGTAAATGAG 


TTTTTCATGT 


TAGTTTCGCG 


CTGAGCAACA 


ATTAAGTTTA 


TGTTTCAGTT 


1800 


CGGCTTAGAT 


TTCGCTGAAG 


GACTTGCCAC 


TTTCAATCAA 


TACTTTAGAA 


CAAAATCAAA 


1860 


ACTCATTCTA 


ATAGCTTGGT 


GTTCATCTTT 


TTTTTTAATG 


ATAAGCATTT 


TGTCGTTTAT ' 


1920 


ACTTTTTATA 


TTTCGATATT 


AAACCACCTA 


TGAAGTTCAT 


TTTAATCGCC 


AGATAAGCAA 


1980 


TATATTGTGT 


AAATATTTGT 


ATTCTTTATC 


AGGAAATTCA 


GGGAGACGGG 


GAAGTTACTA 


2040 


TCTACTAAAA 


GCCAAACAAT 


TTCTTAGAGT 


TTTACTCTCT 


CTACTCTAGA 


GCTTGGCACT 


2100 


GGCCGTCGTT 


TTACAACGTC 


GTGACTGGGA 


AAACCCTGGC 


GTTACCCAAC 


TTAATCGCCT * 


2160 


TGCAGCACAT 


CCCCCTTTCG 


CCAGCTGGCG 


TAATAGCGAA 


GAGGCCCGCA 


CCGATCGCCC 


2220 


TTCCCAACAG 


TTGCGCAGCC 


TGAATGGCGA 


ATGGCGCCTG 


ATGCGGTATT 


TTCTCCTTAC 


2280 


GCATCTGTGC 


GGTATTTCAC 


ACCGCATATG 


GTGCACTCTC 


AGTACAATCT 


GCTCTGATGC 


2340 


CGCATAGTTA 


AGCCAGCCCC 


GAGACCCGCC 


AACACCCGCT 


GACGCGCCCT 


GACGGGCTTG 


2400 


TCTGCTCCCG 


GGATCCGCTT 


ACAGACAAGC 


TGTGACCGTC 


TCCGGGAGCT 


GCATGTGTCA 


2460 


GAGGTTTTCA 


CCGTCATCAC 


CGAAACGCGC 


GAGACGAAAG 


GGCCTCGTGA 


TACGCCTATT 


2520 


TTTATAGGTT 


AATGTCATGA 


TAATAATGGT 


TTCTTAGACG 


TCAGGTGGCA 


CTTTTCGGGG 


2580 


AAATGTGCGC 


GGAACCCCTA 


TTTGTTTATT 


TTTCTAAATA 


CATTCAAATA 


TGTATCCGCT 


2640 


CATGAGACAA 


TAACCCTGAT 


AAATGCTTCA 


ATAATATTGA AAAAGGAAGA 


GTATGAGTAT 


2700 


TCAACATTTC 


CGTGTCGCCC 


TTATTCCCTT 


TTTTGCGGCA 


TTTTGCCTTC 


CTGTTTTTGC 


2760 


TCACCCAGAA 


ACGCTGGTGA 


AAGTAAAAGA 


TGCTGAAGAT 


CAGTTGGGTG 


CACGAGTGGG 


2820 


TTACATCGAA 


CTGGATCTCA 


ACAGCGGTAA 


GATCCTTGAG 


AGTTTTCGCC 


CCGAAGAACG 


2880 


TTTTCCAATG 


ATGAGCACTT 


TTAAAGTTCT 


GCTATGTGGC 


GCGGTATTAT 


CCCGTATTGA 


2940 


CGCCGGGCAA 


GAGCAACTCG 


GTCGCCGCAT 


ACACTATTCT 


CAGAATGACT 


TGGTTGAGTA 


3000 


CTCAGCAGTC 


ACAGAAAAGC 


ATCTTACGGA 


TGGCATGACA 


GTAAGAGAAT 


TATGCAGTGC 


3060 
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TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA TCGGAGGACC 3120 

GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT GTAACTCGCC TTGATCGTTG 3180 

GGAACCGGAG CTGAATGAAG CCATACCAAA CGACGAGCGT GACACCACGA TGCCTGTAGC 3240 

AATGGCAACA ACGTTGCGCA AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA 3300 

ACAATTAATA GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT 3360 

TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT CTCGCGGTAT 3420 

CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC GTAGTTATCT ACACGACGGG 3480 

GAGTCAGGCA ACTATGGATG AACGAAATAG ACAGATCGCT GAGATAGGTG CCTCACTGAT 3540 

TAAGCATTGG TAACTGTCAG ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT 3600 

TCATTTTTAA TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 3660 

CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTGAGACCCC GTAGAAAAGA TCAAAGGATC 3720 

TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG CAAACAAAAA AACCACCGCT 3780 

ACCAGCGGTG GTTTGTTTGC CGGATCAAGA GCTACCAACT CTTTTTCCGA AGGTAACTGG 3840 

CTTCAGCAGA GCGCAGATAC CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCACCA 3900 

CTTCAAGAAC TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 3960 

TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT AGTTACCGGA 4020 

TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA CAGCCCAGCT TGGAGCGAAC 4080 

GACCTACACC GAACTGAGAT ACCTACAGCG TGAGCATTGA GAAAGCGCCA CGCTTCCCGA 4140 

AGGGAGAAAG GCGGACAGGT ATCCGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG 4200 

GGAGCTTCCA GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 4260 

ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA AAAACGCCAG 4320 

CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT TTTGCTCAGA TGTTCTTTCC 4380 

TGCGTTATCC CCTGATTCTG TGGATAACCG TATTACCGCC TTTGAGTGAG CTGATACCGC 4440 

TCGGCGCAGC CGAACGACCG AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC 4500 

AATACGCAAA CCGCCTCTCC CCGCGCGTTG GCCGATTGAT TAATGCAGCT GGCACGACAG 4560 

GTTTGCCGAC TGGAAAGCGG GCAGTGAGCG CAACGCAATT AATGTGAGTT AGCTGACTCA 4620 

TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT ATGTTGTGTG GAATTGTGAG 4680 

CGGATAACAA TTTCACACAG GAAACAGCTA TGACCATGAT TACG 4724 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCACCATGGC CATTAGTGGA GTC 23 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TTTGGATCCT TACAGAGGCC CCCTGCGTT 29 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
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(iv) ANTI -SENSE: NO 



CCACCATGGT GTGTCTGAGG CTCC 



24 
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(2) INFORMATION FOR SEQ ID NO: 12: 
( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TTTGGATCCT CAGCTCAGGA ATCCTCTTG 29 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCACCATGGT CCTAAACAAA GCTCTGAT 28 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TTTGGATCCT CACAAGGGCC CTTGGTGTCT 30 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCACCATGGC TTGGAAGAAG GCCTTT 26 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TTTAGATCTC AGTGCAGAAG CCCTTT 26 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCACCATGGG CCCTGAAGAC AGAAT 25 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TTTGGATCCT CACAGGGTCC CCTGGGC 
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(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

CCACCATGGT TCTGCAGGTT TCTGCG 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TTTGGATCCT TATGCAGATC CTCGTTGAA 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

AAGAATTCAC TAGAGGCTAG AGCCAT 26 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

AAGGATCCTC ACAGGGTGAC TTGACC 26 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GCGTTGCAGG 


ACAGGATGTG 


GTGCCCGATG 


TGACTAGCTC 


TTTGCTGCAG 


GCCGTCCTAT 




CCTCTGGTTC 


CGATAAGAGA 


CCCAGAACTC 


CGGCCCCCCA 


CCGCCCACCG 


CCACCCCCAT 




ACATATGTGG 


TACGCAAGTA 


AGAGTGCCTG 


CGCATGCCCC 


ATGTGCCCCA 


CCAAGAGTTT 




TGCATCCCAT 


ACAAGTCCCC 


AAAGTGGAGA 

AAAO X WAiVJfx 


ACCGAACCAA TTCTTCGCGG 


GCAGAACAAA 




AGCTTCTGCA 


CACGTCTCCA 

u<XwV7 X u X V^VJfX 


CTCGAATTTG 

O X O O AA X X X O 


GAGCCGGCCG 


GCGTGTGCAA kkGkQQTQkk 


inn 


TCGAACGAAA 


GACCCGTGTG 


TAAAGCCGGG 

X AAAvJOOOOO 


TTTCCAAAAT 


GTATAAAACC 


GAGAGCATCT 




CGnf^AATRTn 

O V3 V> OXU\ J. VJ J. VJ 


GATGAGTTGT 

OA X OfW7 X X O X 


X O AO O AO o 


AAAATCAAGT 


GAATCATCTC 


AGTGCAACTA 




AAfinCfiGGAA 

£U\U O O O U 


X X OOAX O Xf\0 


AcnnTAfiAnr 

AOO O X AOAO O 


CATGGATGAC 


CAACGCGACC 


TCATCTCTAA 

X X w X X AXtX 


480 


nrATnAnriAA 


X XOOOOrLXAO 


X OOOOAAOOU 


CCCTAGAGAG 


CCAGAAAGGT 


GCAGCCGTGG 


540 




C\\J OUUXVjXXX 


O XoXOO J.OO L 


GGCTCTGCTC 


TTGGCTGGGC 


AGGCCACCAC 


600 


TGHTTACTTH 


O X O X AOOAVJO 


AAC.AQC^c^rrn 

AAOAOOO O O V7 


CCTAGACAAG 


CTGACCATCA 


CCTCCCAGAA 


ooU 


rrTHfAArTr; 

X uurVnO X u 


O/WjAVjOO X X O 


\J L»A i O AAVI U i 


TCCGAAATCT 


GCCAAACCTG 


TGAGCCAGAT 


~i r\ r\ 

720 


nrnfiATfinnT 

uouurixooo X 


AO XOOOX XOO 


Tr;ATnrr:Trr 

XoAiowo X OL> 


AATGTCCATG 


GATAACATGC 


TCCTTGGGCC 


-T o r\ 

780 


TnTHAAfiAAr 


flTTArPA AHT 

o X XAOOAAO X 


ACCinrAACAT 

aOoOOAAOaX 


GACCCAGGAC 


CATGTGATGC 


ATCTGCTCAC 


84U 


GAGGTCTGGA 


O O O O X VJ vA w X 


ACCC.C.C. AQCT 

AOOOOOAOO X 


GAAGGGGACC 


TTCCCAGAGA 


ATCTGAAGCA 


yuu 


TCTTAAGAAC 


X OOAX OOAXO 


O OO X OAAO X O 


GAAGATCTTC 


GAGAGCTGGA 


TGAAGCAGTG 


Q 

you 


nJ^^XOXXVJXXX 


r^Af^AT^AfiTA 

V7 AuA X O AO OA 


AfiAArTrrpT 

AOAAO X OOO X 


GGAGGAGAAG 


AAGCCCACAG 


AGGCTCCACC 


1 nor» 


TAAAGAGCCA 


CTGGACATfifi 

O X OO AO A X \3\J 


AAGAncTATr: 

AAVJ AO O X A X O 


TTCTGGCCTG 


GGAGTGACCA 


GGCAGGAACT 




GGGTCAAGTC 


AGnnTGTGAA 


GAGAGAfinrn 

OA O AO AOO O O 


AGCATCAAGC 


TTATCGATAC 


CGTCGACCTG 


1 1 /■ n 


CAGGCATGCA 


ATTCGATf^CA 

AX XOOAXOOA 


CACTCACATT 

OAO X OAOAX X 


CTTCTCCTAA 


TACGATAATA 


AAACTTTCCA 


IZUU 


TGAAAAATAT 


GGAAAAATAT 

uVJAAAAAX AX 


ATGAAAATTn 

A X O AAAA X X O 


AGAAATCCAA 


AAAACTGATA 


AACGCTCTAC 


izou 


TTAATTAAAA 


TAGATAAATG 

X AO A X AAA X \J 


OOAOOOO OAO 


GAATGGCGGA 


GCATGGCCAA 


GTTCCTCCGC 


1 ion 


CAATCAGTCG 


TAAAAHAfiAA 

X AAAAO A V7 AA 


CTCHTCCAAA 

O X OO X OOAAA 


GCGGATAGAA 


AGAATGTTCG 


ATTTGACGGG 


lion 


CAACnATGTn 


X OO XAX O XOw 


OouAx XoOoo 


AuvjAAX XVjUA 


CTGGAGACCA 


GCAAGGTTCT 


1 /. /. n 


CATGACGAAfi 


AATATAfimf^ 

AA X A X AO O O V7 


X V7AO X OAO OO 


GGAAGCTCGG 


TTTCTGTCCA 


GATCGAACTC 


1 


AAAAGTAGTr 


O AU O O AO X u O 


^ X O X UOAAAO 


TAATTAAGTA 


AATGAGTTTT 


TCATGTTAGT 


1 joU 


TTGGnGflTf^A 
X xouooo xori 


nrAATAATTA 

OOAAOAAX XA 


AO XXXaXoXX 


TCAGTTCGGC 


TTAGATTTCG 


CTGAAGGACT 


ioZU 


TGCCACTTTC 


AATCAATACT 


TTAGAACAAA 


ATCAAAACTC 


ATTCTAATAG 


CTTGGTGTTC 


1680 


ATCTTTTTTT 


TTAATGATAA 


GCATTTTGTC 


GTTTATACTT 


TTTATATTTC 


GATATTAAAC 


1740 


CACCTATGAA 


GTTCATTTTA 


ATCGCCAGAT 




TTGTGTAAAT 


ATTTGTATTC 


1800 


TTTATCAGGA 


AATTCAGGGA 


GACGGGGAAG 


TTACTATCTA 


CTAAAAGCCA 


AACAATTTCT 


1860 


TACAGTTTTA 


CTCTCTCTAC 


TCTAGAGCTT 


GGCACTGGCC 


GTCGTTTTAC 


AACGTCGTGA 


1920 


CTGGGAAAAC 


CCTGGCGTTA 


CCCAACTTAA 


TCGCCTTGCA 


GCACATCCCC 


CTTTCGCCAG 


1980 


CTGGCGTAAT 


AGCGAAGAGG 


CCCGCACCGA 


TCGCCCTTCC 


CAACAGTTGC 


GCAGCCTGAA 


2040 


TGGCGAATGG 


CGCCTGATGC 


GGTATTTTCT 


CCTTACGCAT 


CTGTGCGGTA 


TTTCACACCG 


2100 
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CATATGGTGC ACTCTCAGTA CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGCCCCGACA 2160 

CCCGCCAACA CCCGCTGACG CGCCCTGACG GGCTTGTCTG CTCCCGGCAT CCGCTTACAG 2220 

ACAAGCTGTG ACCGTCTCCG GGAGCTGCAT GTGTCAGAGG TTTTCACCGT CATCACCGAA 2280 

ACGCGCGAGA CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT 2340 

AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG 2400 

TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT 2460 

GCTTCAATAA TATTGAAAAA GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT 2520 

TCCCTTTTTT GCGGCATTTT GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT 2580 



(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AAGAATTCAC CATGGATGAT CAGCGCGACC TT 32 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

AAAGGATCCT CACATGGGGA CTGGGCCCAG A 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE; NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AAACCATGGG TCATGAACAG AACCA 

(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TTTGTCGACT CAGTCACCTG AGCAAGG 
(2) INFORMATION FOR SEQ ID NO: 28: 
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(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

AAACCATGGT CTCATTCCTG CC 22 

(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TTTGTCGACC TAGGAAATGT GCCATCC 27 

(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



wo 97/46: 



1» 



107 



PCT/US97/08697 



(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



TTTAGAATTC ACCATGGCTT CAACCCGTGC CAAG 



34 



(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

TTTAGTCGAC TCAGGGAGGT GGGGCTTGTC C 31 

(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

ACCCTTGAAT TCATGGCTCC CAGCAGCCCC CGGCCC 

(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ' 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATTACCGGAT CCTCAGGGAG GCGTGGCTTG TGTGTTCGG 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AAGGTACCCG TGGAGACTGC CAGAGAT 



(2) INFORMATION FOR SEQ ID NO: 35: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TTTGGATCCC TATGGCCGGA AGGCCTG 27 




ill 



(2) INFORMATION FOR SEQ ID NO: 36: 



(i) SEQUENCE CHARACTERISTICS: 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(A) LENGTH: 27 base pairs 




(iii) 



(ii) 



MOLECULE TYPE: cDNA 



HYPOTHETICAL: NO 



(iv) 



ANTI -SENSE: NO 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



AAGAATTCCT GTCAGAATGG CCACCAT 



27 



(2) INFORMATION FOR SEQ ID NO: 37: 



(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 28 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

TTTAGATCTT CACTCAGCTC TGGACGGT 

(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

ACCCTTGAGC TCATGGTTGC TGGGAGCGAC GCGGGG 

(2) INFORMATION FOR SEQ ID NO; 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



ATTACCGGAT CCTTAAAGAA CATTCATATA CAGCACAATA CA 



42 



(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

TTTAGAATTC ACCATGGCTT GCAATTGTCA GTTG 34 

(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: - linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
TTTAGTCGAC CTAAAGGAAG ACGGTCTGTT C 31 



(2) INFORMATION FOR SEQ ID NO;42: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

ACCCTTGAAT CCATGGGCCA CACACGGAGG CAG 33 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:43: 

ATTACCGGAT CCTTATACAG GGCGTACACT TTCCCTTCT 39 

(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



TTTAGAATTC ACCATGGACC CCAGATGCAC CATGGG 



36 



(2) INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

TTTAGTCGAC TCACTCTGCA TTTGGTTTTG CTGA 34 

(2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

ACCCTTGAGC TCATGGATCC CCAGTGCACT ATG 3: 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

ATTACCCCCG GGTTAAAAAC ATGTATCACT TTTGTCGCAT GA 42 

(2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48: 

AAAGGATCCA CCATGCAGCA GCCCTTCAAT T 31 
(2) INFORMATION FOR SEQ ID NO: 49: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 

TTTGGATCCT TAGAGCTTAT ATAAGCCGA 

(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

AAAGAATTCG GTACCATGCC GGAGGAGGGT TCGG 

(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

TTTGGATCCT CAGGGGCGCA CCCACTGCA 29 

(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

He Ser Gin Ala Val His Ala Ala His Ala Glu He Asn Glu Ala Gly 
15 10 15 

Arg 

(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Pro Lys Tyr Val Lys Gin Asn Thr Leu Lys Leu Ala Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Lys Thr He Ala Thr Asp Glu Glu Ala Arg Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Gin Ala Ser Leu Ala Leu Ser Tyr Arg Leu Asn Met Phe Thr Pro 
15 10 15 

(2) INFORMATION FOR SEQ ID NO:56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Phe Val Arg Phe Asp Ser Asp Ala Ala Ser Gin Arg Met 
1 5 10 



